Rank in Wordlist | Frequency | Word |
---|---|---|
3496 | 48 | 1,5 |
4955 | 34 | 2,5 |
9183 | 18 | 3,5 |
11525 | 14 | 1,2 |
13203 | 12 | 1,1 |
13228 | 12 | 4,5 |
14274 | 11 | 12,7 |
14287 | 11 | 5,8 |
15483 | 10 | 0,5 |
15484 | 10 | 1,7 |
Rank in Wordlist | Frequency | Word |
---|---|---|
83714 | 1 | .) |
Rank in Wordlist | Frequency | Word |
---|---|---|
4362 | 39 | 50% |
5799 | 29 | 10% |
5804 | 29 | 20% |
6237 | 27 | 70% |
6452 | 26 | 80% |
7260 | 23 | 60% |
7594 | 22 | 1% |
7602 | 22 | 25% |
7603 | 22 | 30% |
8360 | 20 | 40% |
Rank in Wordlist | Frequency | Word |
---|---|---|
9252 | 18 | R&B |
28785 | 5 | P&G |
55397 | 2 | A&M |
57233 | 2 | D&D |
58427 | 2 | G&L |
62774 | 2 | P&G-a |
63788 | 2 | R&B/Hip-Hop |
64263 | 2 | S&T |
89173 | 1 | AD&D |
90535 | 1 | Allen&Unwin |
Rank in Wordlist | Frequency | Word |
---|---|---|
60090 | 2 | Ke$ha |
115159 | 1 | Ke$hina |
145610 | 1 | Tydolla$ign |
Rank in Wordlist | Frequency | Word |
---|---|---|
265 | 395 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
8750 | 19 | Don't |
11571 | 14 | Can't |
13202 | 12 | .' |
14381 | 11 | I'm |
24589 | 6 | O'Brien |
27817 | 5 | Ain't |
28426 | 5 | King's |
28721 | 5 | O'Neal |
29303 | 5 | What's |
29609 | 5 | dell'arte |
Rank in Wordlist | Frequency | Word |
---|---|---|
83888 | 1 | 1+1 |
85903 | 1 | 2+2 |
86708 | 1 | 2B+D |
86729 | 1 | 3+2 |
95601 | 1 | Bosch+Bosch |
96925 | 1 | CD+G |
123661 | 1 | Mitch+5 |
145799 | 1 | UTC+12 |
145800 | 1 | UTC+14 |
145801 | 1 | UTC+8 |
Rank in Wordlist | Frequency | Word |
---|---|---|
1934 | 82 | km/h |
4159 | 41 | i/ili |
10495 | 16 | m/s |
15485 | 10 | 1/3 |
17453 | 9 | br.47/62 |
18827 | 8 | 2/3 |
20671 | 8 | stan/km² |
21047 | 7 | 2006/07 |
24453 | 6 | M/T |
25969 | 6 | m³/s |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots